Multichannel Speech Enhancement

نویسندگان

  • Lino García
  • Soledad Torres-Guijarro
چکیده

1.1 Adaptive Filtering Review There are a number of possible degradations that can be found in a speech recording and that can affect its quality. On one hand, the signal arriving the microphone usually incorporates multiple sources: the desired signal plus other unwanted signals generally termed as noise. On the other hand, there are different sources of distortion that can reduce the clarity of the desired signal: amplitude distortion caused by the electronics; frequency distortion caused by either the electronics or the acoustic environment; and time-domain distortion due to reflection and reverberation in the acoustic environment. Adaptive filters have traditionally found a field of application in noise and reverberation reduction, thanks to their ability to cope with changes in the signals or the sound propagation conditions in the room where the recording takes place. This chapter is an advanced tutorial about multichannel adaptive filtering techniques suitable for speech enhancement in multiple input multiple output (MIMO) very long impulse responses. Single channel adaptive filtering can be seen as a particular case of the more complex and general multichannel adaptive filtering. The different adaptive filtering techniques are presented in a common foundation. Figure 1 shows an example of the most general MIMO acoustical scenario.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Time Delay Compensation for Adaptive Multichannel Speech Enhancement Systems

Several algorithms for adaptive multichannel speech enhancement have been tested in an office room and in an anechoic chamber using different noise sources. Temporal synchronisation requires time delay estimation and compensation of the desired speech signal received at different sensors.

متن کامل

Student-Teacher Learning for BLSTM Mask-based Speech Enhancement

Spectral mask estimation using bidirectional long short-term memory (BLSTM) neural networks has been widely used in various speech enhancement applications, and it has achieved great success when it is applied to multichannel enhancement techniques with a mask-based beamformer. However, when these masks are used for single channel speech enhancement they severely distort the speech signal and m...

متن کامل

Beta-order minimum mean-square error multichannel spectral amplitude estimation for speech enhancement

In this paper, the minimum mean-square error (MMSE) ˇ-order estimator for multichannel speech enhancement is proposed. The estimator is an extension of the single-channel MMSE ˇ-order and multichannel MMSE short-time spectral amplitude estimators using Rayleigh and Gaussian distributions for the statistical models under the assumption of a diffuse noise field where the noise is estimated indepe...

متن کامل

A Multichannel Feature Compensation Approach for Robust ASR in Noisy and Reverberant Environments

In this paper we propose a multichannel feature compensation approach for automatic speech recognition in reverberant and noisy environments. The proposed technique propagates the posterior of the clean signal estimated by a multichannel Wiener filter in short-time Fourier transform (STFT) domain into Mel-frequency cepstrum coefficients (MFCC) domain. The multichannel Wiener filter reduces both...

متن کامل

A Multi-Microphone Speech Enhancement Algorithm Tested Using Acoustic Vector Sensors

In this paper, we present a speech enhancement algorithm for multi-microphone systems that enhances a target signal in noisy multi-talker situations. We apply the general multichannel Wiener filtering framework, for which we have developed a new technique to directly estimate the auto-correlation of the target signal assuming its direction is known. The advantage of our approach compared to tra...

متن کامل

Example-based speech enhancement with joint utilization of spatial, spectral & temporal cues of speech and noise

This paper proposes a multichannel speech enhancement technique that leverages three essential cues embedded in the observed signal, i.e., spatial, spectral and temporal cues, for differentiating target clean speech components from noise. The proposed method estimates clean speech and noise features using a single optimization criterion by integrating two approaches, namely, exampleand model-ba...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008